Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats
نویسندگان
چکیده
Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.
منابع مشابه
VNTR9 and VNTR10, two newly-found variable-number tandem repeat loci useful in MLVA genotyping of Bordetella pertussis
Background & Aims: Bordetella pertussis, the causative agent of whooping cough, continues to infect human hosts even in those populations where infants and children are routinely vaccinated. Causes of pertussis epidemiology are not fully identified unless strains of the pathogen are characterized by molecular means. Golbally, Multi Locus Variable Number of Tandem Repeats analysis (MLVA) has pro...
متن کامل[Kinship determination using DNA markers].
BACKGROUND Autosomal and Y chromosome short tandem repeats (STRs) and mitochondrial DNA polymorphisms are the most commonly used molecular tools for determination of kinship. AIM To report a revision of 1,120 kinship cases (paternity and others) analyzed in our laboratory. MATERIAL AND METHODS Revision of all kinship cases analyzed between years 2001-2006. Autosomal and Y chromosome STRs an...
متن کاملGenetic analysis of two STR loci (VWA and TPOX) in the Iranian province of Khuzestan
Objective(s): Short tandem repeat (STR) loci are the most informative DNA genetic markers for attempting to individualize biological material for application in paternity and forensic cases. Materials and Methods: Blood samples were collected and the total genomic DNA was extracted. The DNA samples were used for genotyping VWA and TPOX STR loci using PCR and polyacrylamide gel electrophoresis. ...
متن کاملSegmental Duplications as a Complement Strategy to Short Tandem Repeats in the Prenatal Diagnosis of Down Syndrome
Background: Quantitative fluorescence-polymerase chain reaction (QF-PCR) is an inexpensive and accurate method for the prenatal diagnosis of aneuploidies that applies short tandem repeats (STRs) as a chromosome-specific marker. Despite its apparent advantages, QF-PCR is not applicable in all cases due to the presence of uninformative STRs. This study was carried out to investigate the efficienc...
متن کاملGenome-Wide Development and Use of Microsatellite Markers for Large-Scale Genotyping Applications in Foxtail Millet [Setaria italica (L.)]
The availability of well-validated informative co-dominant microsatellite markers and saturated genetic linkage map has been limited in foxtail millet (Setaria italica L.). In view of this, we conducted a genome-wide analysis and identified 28 342 microsatellite repeat-motifs spanning 405.3 Mb of foxtail millet genome. The trinucleotide repeats (∼48%) was prevalent when compared with dinucleoti...
متن کامل